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TW. f*In«-up tut] t 

?C7C3 I S0 

DCVCjTA 1 CC3CCGCCC3 CCCCCACC7C CCACCCTCCA CACCCCCCCC SO 

MUCSLVTNC I --- - > 3 

51 - 100 

30VC3TA Si CC3CTCCC7C CCCCCCCTCC CCCCCCCCCC GCACCACCSC CCCCCCCCCS iOO 

iWSCLTTHa 51 1,00 



D0UC37A 101 CCCCACCCCA GCGCACCCCC ACACCCCCCC CCCCCAC™ ISO 

KU3CLra3 101 " 150 

LSI - 

3QV0S7A IS I OCCAQOACOC ACCCCCCCCA CTCT7CCCCC ACZCCACCAC CCC^TC^CCC > 00 

>«5CLW3 IS! — 203 

rcrcD 201 — - - - 

30VC3TA 201 ACCCCACCca CZQCCZXOCZ CCCACCCCGC CCACCTTCTQ CCCATCACC3 >SQ 

HUKUTOS 201 CC TCTTACGACa CTCCACA77C 7CGG7COAGC CC7AGCCCT3 ISO 

?TO3 2SI 

3QVQS7A 2 SI AAACCACGTO 7CC7CVAC7Q QCCAGCCACC TCTCCCCAAQ ACCAAC77GC 2 00 

WJKLTTWS • 251 CCTT7TCTTA QC7GCC7GAC ACCTTCCCTT G7ACAC7C7T CT7CCAATCA 3 00 

PCTC3 301 - — 350 

DQVGS7A 201 C7CGCATT75 CACCCAAACA CCACACACT7 CACAAAATCA ACGCAQ7CA0 250 

201 CAACTACCCA TTCTOCTGAA GACC7CGCCC 7C7CACCC7C ^CC.\CTTC<] 2 50 

? arco 2 si ccTcccTccr tctccagvcc wo 

30VC3TA 251 XAGCC7CCAC CT7CCC7TCC TCCC— AGC CC7CCC7C77 7C7CCACAAC 00 

WJSCOTNS 151 AACCCTC7AC C77CC777CC 7CTCC7CACC CCTKCTCCT TC=CACCCC 100 

fCTCD €01 ACAGC7CAC7 ACAACTT-OT 77C- GCC 7777ACTCTC OCCCCACACA *50 

S0VGSTA 401 CCACC7CAC7 ACAAC77-CC TACT7TT0 CC T777AC7C7A GCAGCACACA SO 

MU5CLm*S 401 ACACC7CCAC ACAACCTC0C TTCCTTTCC7 C777CC77TC CACCCAACAC 450 

PC7CD 4S1 AGCACACCA7 CAG " 

DOVCSTA. 451 ACCACACCA7 CAC ^00 

WJGCLmiC 451 ACCXACCAT CACCCTOCT TTCAACTCAA CACA7C7CC7 7ACCCCAG7C soa 



PCTC3 
30VC5TA 



?CTCD 

DCVC3TA 

HUSGLYTlIS 



poto 

3CVCS7A 



PC7C3 
DCVCS7A 



DCVC37A 




3 ! £*.r 



601 
501 



-CACAAAA 



-CACAAAA 



501 CAAC7CACAA AOACTCTTC CATCCTCAAC ATCTCCATCA CACCACAAAA 



551 

551 
651 



701 
701 
701 



7<%k7Cf\ATC7 

TMvTcyvA?cr; 



GTAATCC7TQ 
CTCA7TCT7G 
G7CC7TG7CC 



CAAACCAACA C7CC7TC7CT CAA7CC7CC7 
CAAACCAAAA C7CAT*7C7C7 CAA7GCTCC7 
CAACCCAAAA CTXATCC7C7 7CA7CC7CA? 

7C7TT7CCCA A7ACATCAAC ACCCCACAAC 
TG7TTTCCGA ATATA7GCAC ACCCCACAAC 
7G77T7QCGA ATATCTCAAC AGCCCACACG 



TCTCTCAACT 
7G7C7CAAC7 
TCrCTCAACC 




751 
751 



C7GCA7XTAC 
CTGCATAAAC 
CTCCA7A7A7 



301 
301 
301 



CC7GG7CG-- 
ACCCCTCwTG 
AGCAC7GGTG 



CAC7CAAAAA ACCCACAAC7 7CGCVGCAC7 
CCATCAACAA ACCCACAAC7 7CC7CCCAGC 
CACACAAAAA TTCCACACC7 TCC7CACAAC 

ELUrl'o ^ Gift* T 

-tttccgagc tcgtttaaca atcgcactca 

CC7TCCCACA TCCTTTAACA A7Q 

GTTCCCAACC 7CC7T7AAAA A7CCCACCCA 



CC7CACACCC 
ACCA77CACA 
AGA7GQCAGA 



CACTTACCAC 
- -CTTACCA7 
CAC77ATCAA 



630 
5SQ 



730 
700 

7oa 



750 
750 
7S0 



3Q0 
300 



350 
350 
350 



PC7C0 
BOVCwTA 



PG7CD 

D0VGS7A 

MUCGLYTNS 



(3 51 CAACAACAAC 
Q SI CAACAACATG 

a 51 gaacacaacc 



301 OVCACCACAC 

901 AACCAAG 

901 — TCAACAC 



ACCCTATACC CAACCAAAAG CAACAAACAA AACAACACAA 
GACACATAAA CGAACAAAAC QAACAAACAA ACCAACACCA 
TACAACCACG CACACAAAAQ CG7ACAAA7G CACATCCCA7 

CT7CCGCTAQ TGGAC7CC77 TAA7CC7GAC AAACCCCCAC 
CT7AACCTAT CGGACTCCT7 CAACCCATTT AAACaCCCCG 
CC7CACCTAT QCCAC7CCTT CAATCCAAAQ AACCCCCCCC 



900 
900 
900 



950 
950 
950 



PG7CD 951 ACCCTCTGAC CATAACCACA TCCAAGKTC CAGTCGTATC GCAAGGCACT 

30VCSTA 951 ACGTTGTGAC CATGACCAAC TCCAACCCTC CAGTGC7C7C CCAACCCACT 

WUSCLTTUS 351 ATC7TTTCAC ACTCACCCCG TCCAACCGGC CCA7TGTG7G CCAACCCACT 



100Q 
1000 
10QQ 



PC7CD 

BOVGSTA 

MUSOLYTMS 



PC7CD 

30VCSTA 

WUSGLTTNS 



1001 TACAACACAa CCCTCTTACA 
1001 TACAACACAC CCGTCTTACA 
1001 TATCACACAC CTCTCCTGCA 



10 SI CGCCT7CACC CTTTTTCCTa 
1051 CGCCC7CACC CTTTTCCCCC 
1051 GGCGCTCACA C7CTTTCCTG 



7AATTATAA? CCCAAACACA 
CAATTA7TA? GCCAACCACA 
AAAC7AC7AC CCCACACACA 

TCCCAACATA CA7TGACCA7 
TCCCAACATA CATTCACCA7 
TCCCAAACTA CATTCACCA7 



AAA77ACCGT 
AAATTACCC? 
AACTCACTC7 



TACTTCCACC 
TACTTCCACC 
7AC77ACAAG 



1050 
1050 
1050 



1100 

a oa 

1140 



?G7CD 1101 AGTTCT7AA? ATCTCCAAAT 1 

DCVCSTA 1101 ACTTCTTAAC C7CTCCTAAT 

WJ3CLYTNS HOI ACTTTCTCCA CTCTCCTCAC 



ACATACTTCA TCCTTCGCCA CAAACTCATC 1150 
AACCACT7CA 7CCTGGCCCA CCC\CTCATC 1150 
A7C7ACT7CA TCCTTCCCCA TCCCCTCA7A 1150 



PCTCC 1151 TTTTACATCA TCC7CCATCA 

0OVCSTA 1151 TT7TA7A7CA TCCTACA7CA 

M'JSCLYTNS 1151 TTT7ACCTCA TCATACATCA 



TATCTCCACC ATCCCTTTCA TAGACCTCCC 1200 
TCTC7CCAGC ATCCCTTTCA TACACTTCCC 12 CO 
CACC7CCCCC A7GCG7CTCG TGCACCTCM 120Q 



PCTC0 1201 TCC7CTCCGC TCC7TTAAAC 

UCY^STA 1201 TCCTC7CCGC TCCTTCAAAC 

MUrCLTTHS U01 CCC7CTACA7 TCC7TACAAC 



7C7TTCACAT CAAC7CCCAC A.\GACGTCCC 1250 
TGTTTAACAT CA,\CCCTCAC AACACCTGCG U50 
7C7T7GACA7 CACC7C7GAC AACACG7CCC IjSO 



4 0 



V 



PCTC3 



PCTCD 
DCVCS7A 



125 L AACACATCAC CATCA7CCCC ATCAACACCA TCCCCCACCA CATC.CCCC 
12 51 ACCACATCAC CATCATCCCC ATCAACACTA TCCCCCACCA CATTGTCCCC 
1251 ACCATA7CAC CATCATCCCC ATCAAGACCA TTOGGGACCA CATCC7CCCC 

1301 CACATCCACC ACCACCTCCA CTTCCTCTTC 7CCATCCACC TCCATCACCT 
1301 CACATCCACC A7CACC77CA CTTCCTTTTC TC CATC CATC TCCACCACC7 

1301 CACATCCACC accacctcca cttcctcttc TCCATCCACC tccatcaact 



uoo 

1300 
13 OQ 



1350 
1350 
13 50 



PCTCD 
DGVCSTA 

Hucc-Lras 



13 51 CTTCCAAAAC AACTTTCCCC TCCACACCCT CCCCCAGTCC CTCXX.CACC 
US1 CTTCCAACAC AACTTTCCCC TCCX0ACCC7 CCCCCACTCC CTCCCCCACC 
12 51 C7T7CAACAC AACTTCCCC3 TCCAAACTCT CCCCOVX.C CTACCACACC 



1400 
1400 
1400 



PCTCD 

BOVCSTA 

MUSCLYTNS 



HQ' TACACCCCTO C7GCTACAAC CCACATCCT3 ACCXCTTCAC CTACCACACC 
UC1 TACAACCCTG CTCCTACAAC CCACATCCCA ATGACTTCAC CTACCACACC 
i ;c TCCACCCCTG C7CCTACAAC CCCACTCCCC ACAACTTCAC CTATCACACC 



1450 
USO 
USO 



PCTCD 

DOVCSTA 

HUCCLY7HS 



U51 CCCAACCAC7 CCCCACCCTA CATTCCCTTT CCCCACCCCC Ai \**A 

US1 CCCAACCACT CTCCACCATA CATTCCCTTC CCCCAACCCC A.*^7A7TA 
1151 CCCCAACTCT CCCCCCCCTA CATTCCATTC CCACACCCCC ATTTTTACTA 



1500 
1S0Q 
1500 



PCTCD 
BQVGSTA 



ISai CCACCCACCC AT7TT7CCCG GAACACCCAC TCACCTTCTA AACATCACTC 
1501 CCATCCACCC ATTTTTCCCC GAACACCCAC TCACGTCCTT AACATCACCC 
1S01 CCACCCCCCC ATTTTTCCAC CAACCCCTAC TCACATTCTC AACC7CACCA 



1550 
1550 
1550 



PGTCQ 

DQVGOTA 

MUCCLYTUS 



I551 ACGACTaCTT CAAGCGAATC CTCCAGCACA ACGAAAATGA CATACAAGCC 
IS 51 ACCAA7CCTT caaagcaatc ctcaaccaca acaaaaatca catacaaccc 

1SS1 CGGAGTCCTT TAACCCCATC CTCCACCACA ACAAACATCA CATACAACCC 



1500 
1500 
1500 



PCTCD 
30VCS7A 

musclyths 



1501 CACTCCCATC ATCAAACCCA TCTAAACAAC TATTTAATTC TCAACAAACC 
lfiOl CAATCCCA7C ATCAAACCCA TCTAAACAAC TAT7TCCTTC TCAACAAACC 
ISO! CACTCCCATC ATCACACCCA CC7CAACAAA TACTTCCTTT TCAACAAACC 



1550 
1550 
1550 



PCTCD 
DQVCC7A 

musglytws 



LSSl CACTAAAATC TTATCCCCAC AATACTCCTG GCA» tATCAT ATACCCATCT 
15 51 TACTAAAATC TTATCCCCGG AATACTGCTG GCATTATCAC ATACCCCTAC 
1551 CACTAAAATC CTATCTCCAO AGTATTCCTC CCACTATCAC ATACCCCTCC 



1700 
1700 
1700 



PCTtD 
BOVCSTA 



PCTCD 

DOVCSTA 

MUSGLTOT 



17 0L C7CTCCATAT TACCXTTGTC AACATACCTT CCCACAAAAA ACACTATAAT 
17Q1 CTCCGCATAT TAAOCTTCTC AACATGTCTT CCCACACAAA ACACTATAAT 
17 01 CTTCACATAT TAAAACTC7C AACCTACCT7 CCCACACAAA AQAGTATAAT 

17 51 TTCCTTACAA A7MCATc|reJbT?XM77 CTCCCACCXa TTTTCTCAAT 

17 51 GTCCTTACAA ATAATCTcjlC /fcTT T CTCCCACTAC ATTTCTCAAT 

17 51 TTCCTTACAA ATAATCTUTC rirrTCAMTT GTO -ATCGAAAC 



1750 
17 50 
17 50 



1300 
13CQ 
1300 



PCTCD 

SCVCSTA 

kvscltos 



13 01 T7CAAACAGT ATTACTCTGG CTACTTCCTC ACACAACTAC -~ CACTTAA 

*3Q1 TTCACACaCT ATTATTCTCC CTACTTCCTC ACAAAACTAA CACTTAA 

1101 TTCACAC--T ATTACTCTCC CTAAT7CCTC AAACAAQTAC CAACACTTCA 



1Q50 
1ES0 
1E50 



P0TC3 

sgycsta 



PCTCD 

acvcrrA 

MV5C:.rTN9 



pctcd 

BQVCS7A 



PCTCD 

BCVCS7A 

HUS0LT7N3 



PGTC3 
0CVCS7A 



PCTCD 
BC-YOSTA 



pctco 

BOVCJTA, 

hu scorns 



PCTCD 
SOVC3TA 



?CTC3 
OQVCSTA 



PCTCD 

DOVCSTX 

WJSCLYTNS 



DCVQS7A 



WTO 

BOVQSTX 

HUKL'fTHS 



sovcsta 

HUSCLTTNS 



III SI T7TTAACT77 TAAAAAAATA CTAACAAAA TACCAA CACACTAA-C .1300 

13 S; 7TTTAACTTA AAAAAAAATA. CTAACAAAA OACCAA CACACCAA-A 13C0 

13 SI TTTCAACTTT TAAAACAA-A CAATCAAAAC CAAAACCCAC TACCAiwCCA 1300 

1901 TACA7ATTAT TCTTCCTTCC AACTTTCACC CT7CTCAAAT CCCACAATCA 1950 

1301 TACATATTA7 TTCTCCTTCT AACTTTCACC CTTCTAATAC CCCACAATCA 1JS0 

1301 AACACATCAT TTCTCCT-CA CACCTTCACC C7-CTAATA7 CTCACAAACA J>50 

13 51 CTCraTCC TAATCACA TC7AAAT7CC CAC7CATTTC 2000 

13 SI ACCTCTCC-- --7AA7CACA rCTAAATTCC CACTCATTTC TTACCTATTT 2QQQ 

13 31 CTC7XTCGCA AC7AATOCC TATAAAT7CT CAAT0A7T7C TTXTATATTC 20QQ 

^ aai - ' :oso 

3001 TTCCTTCTCa CCCC«GCAA TCCA7ACACC ATCAQTTGAA CC 1050 

2001 TCCCTC7TC3 QAAAACTTCA TTC7ACAAA7 CAAAATTAAT TTCACAAACC 2Q50 

2100 

\v. uqq 

aaaaccacat gccgcaaact tcttcccag4 ctgtcataca attcaccact 21Q0 

2130 

ai« 2isa 

:l01 OCCCACcicC TCACACAACC ATTAcicAAC ACTC7CCCTT GTOTCACACT 2150 

2200 

2i5i • "!!!!!!!! 2:0a 

2151 TCCACCCCTC C^TCCCTTTa CCTTCATTAT CT7CCTCCTC ATCCACATTC 2200 

2250 

2201 \\ \ 2250 

'All T^OCAACC CACACACCCT TOCACCCAC ACACC^lxA TAACCATCCC 2250 

2300 

i2si 23 oa 

fJl AATCTCACCA TCACTCTCTA AAACCTCAtG CCTCCAC^a cccTccact =300 

3350 

2301 * !!!!"""! *. ^ 5a 

*1* *1 'lllll^r TCCCCATAAA CCAC3CACAC 3350 

2301 CCACCCCCCA C7AGQC7AT7 OTTTCTCTCC TCCCCATJV^ 

240O 

23S1 2i00 

™<*^ :t0 ° 

2450 

210L * ' - 2450 

1«1 £^ X^ACT^ »50 

2S0O 

H$l 2SO0 

WW kzzzz 1500 




aovcrrx 



PC7CD 

DCVCC7A 

XUSCLYTHS 



acvcrni 



?C7CD 
KU53W7113 



PCTC3 

2CVGSTA 

KUSCLTTHS 



pcrrca 

KUSCLTTTiS 



PCTCO 

J0VC3TX 

MUSCLYTNE 



PCTC3 
30VCSTA 



?CTCD 
D0VCS7A 



PCTCD 

BOVCSTA 

HVSCUfTHC 



PCTCD 
HUS0U7IU 



2Soi :sso 

2SQI 1550 

2 SOI TCCAAACT07 CTTCATTTTC TITTCCCCAA XTTC7ATTCA TCC7CTTACC IS SO 

25SL :sao 

3S5i 2saa 

2 ssi. rrrcTcrcTT cacccctctc cagaccwCt cactct;atca cccaaccac* isao 

3501 

2601 t 26SO 

2 COL C7ACC7CACC CCACTCACCA. CCAOCACCC? XrrATATCVC 2633 

z«sl irao 

:<sl -700 

2 ssi ctcatcatca qc7cctacc7 acaacctcct ctcaacctcc cac^tcctca 2700 

2701 , 27S0 

2701 • 2750 

2701 occcATcarc ttccactctq 0CCCCCTC7A TCCACCACC? CACTCAACAC 27 so 

27S1 2300 

2751 2J0O 

2751 AAA0CCCCC7 QTCACATCAC CTCMTTCCC C7CC7CTAC7 ACTA70CAAC 2300 

2801 2350 

2001 2350 

ia01 TCTCACACCC MCCAOCCAO ATCTACTCCA CAACATACCA ACCCACTTTA 2*50^ 

2851 :30 ° 

2351 2300 

2B51 TGCCAATCCG AOCCGCACTC ACTACAACCC ACCTGCTCAA CCTTCrCTTC 2300 

IJ01 1350 

2301 

2301 CCCCCTCTCA CACCCTCCAG CACCCCCTOT ATACC7CC7T CTCAACCTA7 23 50 

2«l 

2551 

29S1 CCCTCXCAC CCCTTTCCCA AGTCTTAAAT GACCCTTT2A CACC7CTCCC 2000 

™ mi 

3001 

J00L CTAACXCCC7 TXAAAAACAT XGATATTTCC ACTC7CACTC GTAACAOTAO 3QSQ 



PCTCD 
DOVOCTA 



3051 

30S1 

30 SI CACAA7TACA GTTATGAAAT AOCAACCCAA ATAATTCTCS CC7TCCTCTC 



3100 
3100 
3100 



PDH Line-up (aa) : 

PGTfrrasa 1] 3- HNVKCXWLS HLLVS7VKW PVEVIMSrEC SITWIYGSXN PEVC-uSAQR SO 

3G?'--aac II 1* HNVKCKVI^S MLWS7VIW 7WEYIJIS?EG SlTWIMPSaN PEVCCSSIQ1C SO 

MC7[7rano 1] 1 MNVKGXVIL.L- MLIYSTVWV FHEYV1ISPDG SJLWIYHTXI PEVCZNRWQK SO 

?GT[--4ffia II SI CWWr PSWFNN cniSYKEZED EnNRC^PLV CWFtf?ESl?S 100 

3C"^'-3n« 1] 5 I CWWLPKWFNN G YHEZDC DINEEXEGRN E3- E-X-XLS CWF^PTICIPE 100 

MO^r*«a« U 31 DWHFPSWFXM CTHGYQEDWV EGnaSR-CHN CSJUESPQW OW?NPWIRPO 100 

PC-[-'aac 1] 101 AVTZTKW1C\? VVWECTTNftA VWNYNAXQK IT/CLTVFAV CRYIEIIYLEZ 15 Q 

aG-[^-isa 11 *01 WTKTXWKA? VWECTWRA VLDNYYAXCK iTVCLrVFAV CRYZ — tYLEH ISO 

HC?l7r*M 1] 101 VL7VTPWKA? IVWECTYDTX LLEXYTWTCK LTVC LTVFAV CXYIEHYLI3 ISO 

prf:-^n 11 131 FLISAtfTYni VCIBWIPTIH VOOISRilPLI ll*C?-aw"XV FSIXSEXRKQ 2QQ 

b<^ i isi FLTSA^ini vcupviptih vcavsnMPt: swpuisfxv ?x:x?Exiwq 200 

KC?I?t1w 11 151 FLS5ABKYFM VCHnVIFT/M IDDT5M4PW KLHPLHSLQV FEIRSEXRWQ 200 

?(7(?Wiaa U 201 DISKHSMK7I GZHILAJIIQ1I EVDFWCMDV WVFQKNFCV STLCQSVAQL 2S0 

BCTt^c 1 201 DISWKRMKrr CEHIVAHIQH ETDFUOfflV DQVFQDXFGV KLCZffVXQL 2 0 

KC?C7r MC 11 201 OISMMBHKTI GEHILAHIQK EVDFLFCHDV DQVFQDNFGV ETLCQLVAQI* 250 

PCTC^aaa 11 251 QANWYXAHPD EFTYEJIRXES AAYIPFCQC3 PYYliAAI FCC T?TQVUNITQ 300 

DC? 1 WHTWPU DfTTEOTZS MTI?F«W FYYHAAS7CG QO 

SSl7™a 11 251 QAWHYKASPE TOWELS AAYIPFCECD FYYHAAIFGG TPTHxLSLTR 300 

PGri-raM U 301 ECFKGILQ0K EOTIEM3M0 ESHLNKYIjIL NWTXILGPS YOtTC-ClGSS 350 

: 1 301 ECFXOILKK KKDIEAQWHD ESHLHKVFW. HJSFTXILSPE YWDttllCW JSC 

MCT -Sc I 301 ECFJCGILQK XKDXSWHIID ESIILNWrtT NXPTKIwSrE YCWKQKL? 3SO 
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V EcoRV 
X Xbal 
Xh Xhol 



No sites for Nde, Pvul, Xho, Kpn, Saclt, Sma, Cla, Apa, Not 
Unmapped sites for Pst, PvuII 

pUBS: 

.... Sad SacII Not Xba Spe Bam Sma Pst Eco RV Hind Cla Sal Xho Apa Kpn 
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non-coding DNA 
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Neo resistance gene 
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Amp resistance gene 




a 1,3 gahcto transferase (aGT) 
non-coding DNA 

Exon9ofaGT 
Neo resistance gene 




Bacterial origin of replication 



Amp resistance gene 



# 



J2. 

o 



C2 — 



-2- _ 

= o ~ ~ ~ 
« u M re re 
a ua Q oo oo 

c w o oo oo 




00 



oo 



00 — 1 



CJ 

CD 

u 

H 



p 1 1 



< 
D 

Ml 

.s 

•o 

8 



o 
o 



o 

< 

D 

s 
8 



o 



E 

o 



B 



< 

2 
D 



I I 0 I 1 I 







1 HLnd 



HindVSca 
Bin* 



Sma digested vector 

Insert digested wiih Sal and filled 




E3 aL J gilacio transferase (aGT) conceding 

B Exon 9 of aGT 

■1 Nco resistance gene 

E3 rlH" recombination site 

S Bacterial origin of replication 

E23 Ajnp resistance gene 
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10 20 30 40 50 50 

GAGGCCTSCA GCAATTCGAT GATCCCCCAG CTTGAACTTC CTATTCCGAA CTTCCTATrC 

70 30 90 100 110 120 

TCTAGAAAGT ATAGCAACTT CAAGCTGCGC TGCAGGAATT CGATTCCAGC ACTCTGCT7T 

130 1-40 150 ISO 170 130 

TGCAACAGGA ACCAAAAACC CTCTC CACCC ACGCCTCGAA TGTTTCCACC CAATG* 



130 200 210 220 230 340 

CAGTCTGGTT TTCCAAGACG AACCAAAAAG CCTCTCCACC CAGGCGTGGA ATGTTTCCAC 

350 250 270 230 2S0 300 

CCAATCTCGA GCAAAC2GCG CCCACCGTCT 7GTCATTGGC CAAT7CGAAC ACGCAGATGC 

310 320 330 340 350 2 50 

AGTCGCGGCG GCGCGCTCCC AGGTCCACTT GGCATATTAA GGTGACGCGT GTGGCCTCGA 

370 330 390 400 410 420 

ACACCGAGCG ACCCTOCACC CAA7ATGGGA TCGGCCATTG AACAAGACGG ATTCCACGCA 

430 440 450 460 470 430 

GGTTCTCCGG CCGCTTGCGT GGAGAGGCTA TTCGGCTATG ACTGGGCACA ACACACAATC 

49Q 500 510 520 530 540 

GGCTGCTCTG ATGCCGCCGT GTTCCGGCTG TCAGCGCAGG GGCGCCCGGT TGTTTTTGTC 

550 550 570 530 530 500 

AAGACCGACC TGTCCGGTGC CCTGAATOAA CTCCAAGACG AGGCAGCGCG GCTATCGTGG 

S10 620 630 640 650 560 

CTGGCCACGA CGGGCGTTCC TTGCGCAGCT GTGCTCGACG TTGTCACTGA ACCGGGAAGG 

S7Q 630 690 700 710 720 

GACTGGCTSC TATTGGGCGA AGTGCCGGGG CAGGATCTCC TGTCATCTCA CCTTGCTCCT 

730 740 750 750 770 730 

GCCGAGAAAC TATCCATCAT GGCTGATGCA ATGCGGCGGC TSCATACGCT TGATC2C 



790 300 310 320 330 340 

ACCTGCCCAT TCGACCACCA AGCGAAACAT CGCATCGAGC GAGCACCTAC TCGCATGGAA 

350 360 370 880 390 900 

GCCGGTCTTG TCGATCAGGA TGATCTGGAC GAAGAGCATC CCCACCCGAA 

910 920 930 940 950 950 

CTGTTCGCCA GCCTCAACCC GCGGATGCCC GACGGCCAGG ATCTCGTCGT GACCCATCGC 

970 930 990 1000 1010 1020 

GATGCCTGCT TGCCGAATA7 CATGGTGGAA AATGGCCGCT TTTCTGGATT CATCGAC7G7 

1Q30 1040 1050 1050 1070 1030 

GGCCCGCVZC GTGTCGCGGA CCGCTATCAC GACATAGCGT TCGCTACCCG TGATATTGCT . 

1090 1100 1110 1120 1130 1140 

GAAGAGCTTG GCGGCGAATG GGCTCACCCC TTCCTCGTGC TTTACGGTAT CGCCGCTCCC 

1150 1150 1170 1130 1150 1200 

GATTCGCAGC GCATCGCCTT CTATCGCCTT CTTGACGAGT TCTTCTGAGG GGATCGGCAA 

1210 1220 1230 1240 1250 1250 

TAAAAAGACA GAATAAAACG CACGGGTGTT GGGCGTTTGT TCGGATCATC AAGCTTGAAG 



1270 1230 1290 1300 1310 1320 

TTCCTATTCC GAAGTTCCTA TTCTCTAGAA ACTATACGAA CTTCAAGCTT ATCGATGAGT 

1330 1340 1350 1360 1370 1330 
AGATCTTGAT CGATACCGTG 
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r -infer sequences : 0-23 
FRT : 29-104 

Polyoma virus enhancer repeats : 105-249 

Herpes Simplex Virus Tyrosine Kinase promoter : 250-385 

Neomycin phosphotransferase coding region : 385-1183 

Herpes Simplex Virus Tyrosine Kinase Poly A signal : 1189-1249 

FRT : 1250-1310 

Linker sequences : 131 1-1340 
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Primer binding sites within mouse ferrochelatase cDNA 
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Primer binding sites within mouse a-l^GalT cDNA 



983fap cDNA spiic product 
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i) Ferrochelatase, FC-F/R 



M, Marker SFP-I 
C, MQW control 
K, KIDNEY 
H, HEART 
L, LIVER 



ii) a-y-GT cDNA spike 
+ 7F/9R2 primers 
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Mouse 15: Wild type; Mouse 20: Heterozygotc Gal KO; \foase 21: 

Homozygous Gal KO 
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T- LEF SEQUENCE - /%^/*<^ 

CTGACACCTTTCGCTTTCCTCTTGCGTCTCCGCCEGCGACCTTTCCCCACCCC 

ggcctctttcctggttgcaccacttcctctcattccaaaggattgtgccctta 
ctgctgctgctrtctgcactggaaacacggggcagcgagccctcttcccatcac 
ccctctaaatgccacctgtgccatacgccacccatgccacgccaacctc 

Met A^n Gin lie Lya Asa Gin Leu Ala Gin Leu Aan Gly 
ATG AAC CAG ATC AAG AAT CAA CTG GCA CAG CTC AAT GGG 

Ser Ala Acn Ala Leu Phe lie Ser Tyr Tyr Thr Ala Gin Gly 
AGC GCC AAT GCT CTC TTC ATT TCC TAT TAC ACA GCT CAA GGX 

Glu Pro Phe Pro Abo, Asa Val Glu Lys Lau Cys Ala Pro Asa 
GAG CCG TTT CCC AAC AAC GTG GAA AAG CTA TGT GCG CCT MC 

Met Thr Asp Phe Pro Sor Pha Kis Gly Asa Gly Thr Glu Lya 
ATG ACA GAC TTC CCA TCT TTC CAT GGC AAC GGG ACA GAG AAG 

Thr Lya Leu Val Glu Leu Tyr Arg Met Val Ala Tyr Leu Ser 
ACC AAG TTG GTG GAG CTG TAT CGG ATG GTC GCA TAC CTG AGC 

Ala Ser Leu Thr Asn He Thr Arg Asp Gin Lys Val Leu ash 
GCC TCC CTG ACC AAT ATC ACC CGG GAC CAG AAG GTC CTG AAC 

Thr Ala Val Ser Leu Gin Val Lys Leu Asn Ala Thr He 
CCC ACT GCC GTG AGC CTC CAG GTC AAG CTC AAT GCT ACT ATA 

Asp Val Met Arg Gly Leu Lau Ser Asn Val Leu Cys Arg Leu 

GAC GTC ATG AGG GGC CTC CTC AGC AAT GTG CTT TGC CGT CTG 

Cys Asn Lya Tyr Arg Val Gly His Val Ago Vol Pro Pro val 
TCC AAC AAG TAC CGT GTG GGC CAC GTG GAT GTG CCA CCT GTC 

?-o A30 Ilia Ser Asp Lya Glu Ala Phe Cln Arg Lys Lys Leu 
CCC GAC CAC TCT GAC AAA GAA GCC TTC CAA AGG AAA AAG TTG 

Gly Cys cla Leu Leu Gly Thr Tyr Lys cln val He Ser val 
GGT TGC CAG CTT CTG GGG ACA TAC AAG CAA GTC ATA AGT GTG 

Val Val Gin Ala Phe **■ 

GTG GTC CAG GCC TTC TAG AGAGGAGGTCTTGAATGTACCATGGACTC . . ♦ 




\ HUMAN T-LIF SSQUENCS 

GACCTTTTGC CTTTTCTCTC TCCTGGTGCA CCATTTCCTC TCCCTCCCTG 50 
AGCCGC-AGTT GTGCCCCTGC TGTTGGTTCT GCACTGGAAA CATGGGGCGG 100 
GGAGCCCCCT CCCCATCACC CCTGTCAACG CCACCTGTGC CATACGCCAC 150 
CCATGTCACA ACAACCTC ATG AAC CAG ATC 132 

Met Asn Gin He 

AGG AGC CAA CTG GCA CAG CTC AAT GGC AGT GCC AAT GCC CTC 227 
Arg Ser C-lr. leu Ala Gin. Leu Asn Gly Ser Ala Asn Ala Leu 
5 10 J- 5 

TTT AT? CTC TAT TAC ACA GCC CAG GGG GAG CCG TTC CCC AAC 272 
Phe He Leu Tyr Tyr Thr Ala Gin Gly Glu Pro ?he Pro Asn 

20 25 ' 30 

AAC CTG GAC AAG CTA TGT GGC CCC AAC GTG ACG GAC TTC CCG 317 
Asn Leu As? Lys Leu Cys Gly Pro Asn Val Thr Asp Phe Pro 

35 40 45 

CCC TTC CAC GCC AAC GGC ACG GAG AAG GCC AAG CTG GTG GAG. 362 
Pro Phe His Ala Asn Gly. Thr Glu Lys Ala Lys Leu Val Glu 

50 55 50 

CTG TAC CGC ATA GTC GTG TAC CTT GGC ACC TCC CTG GGC AAC 407 
Leu Tyr Arg He Val Val Tyr Leu Gly Thr Ser Leu Gly Asn 

65 70 
ATC ACC CGG GAC CAG AAG ATC CTC AAC CCC AGT GCC CTC AGC 452 
He Thr Arg Asp Gin Lys He Leu Asn Pr& Ser Ala Leu Ser 
75 80 35 

CTC CAC AGC AAG CTC AAC GCC ACC GCC GAC ATC CTG CGA GGC 497 
Leu Kis Ser Lys Leu Asn Ala Thr Ala Asp He Leu Arg Gly 

90 95 10° 

CTC CTT AGC AAC GTG CTG TGC CGC CTG TGC AGC AAG TAC CAC 542 
Leu Leu Ser Asn Val Leu Cys Arg Leu Cys Ser Lys Tyr Kis 

105 11° 115 

GTG GGC CAT GTG GAC GTG ACC TAC GGC CCT GAC ACC TCG GGT 537 
Val Gly Kis Val Asp Val Thr Tyr Gly Pro As? Thr Ser Gly 
120 125 130 

AAG GAT GTC TTC CAG AAG AAG AAG CTG GGC TGT CAA CTC CTG 
Lys As? Val Phe Gin Lys Lys Lys Leu Gly Cys Gin Leu Leu 



632 



V 135 140 

GGG AAG TAT AAG CAG ATC ATC GCC GTG TTG GCC CAG GCC TTC 677 
Gly Lys Tyr Lys Gin lie lie Ala Val Leu Ala Gin Ala Phe 
145 150 155 

TAG CAGGAGGTCT 7 22 
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BamHI Nae£ BamHI 
Xba ^ aci 



20X3 SV40 origin of replication 

M Adenovirus promoter 

vszm LDF T coding region 

g^?l Dihydrofolate reductase 3 # end 

annua Amptcillin resistance gene 

MB Bacterial origin of replication 
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